Scheduling Human Intelligence Tasks in Multi-Tenant Crowd-Powered Systems
نویسندگان
چکیده
Micro-task crowdsourcing has become a popular approach to e↵ectively tackle complex data management problems such as data linkage, missing values, or schema matching. However, the backend crowdsourced operators of crowd-powered systems typically yield higher latencies than the machineprocessable operators, this is mainly due to inherent efficiency di↵erences between humans and machines. This problem can be further exacerbated by the lack of workers on the target crowdsourcing platform, or when the workers are shared unequally among a number of competing requesters; including the concurrent users from the same organization who execute crowdsourced queries with di↵erent types, priorities and prices. Under such conditions, a crowd-powered system acts mostly as a proxy to the crowdsourcing platform, and hence it is very di cult to provide e ency guarantees to its end-users. Scheduling is the traditional way of tackling such problems in computer science, by prioritizing access to shared resources. In this paper, we propose a new crowdsourcing system architecture that leverages scheduling algorithms to optimize task execution in a shared resources environment, in this case a crowdsourcing platform. Our study aims at assessing the e ciency of the crowd in settings where multiple types of tasks are run concurrently. We present extensive experimental results comparing i) di↵erent multi-tenant crowdsourcing jobs, including a workload derived from real traces, and ii) di↵erent scheduling techniques tested with real crowd workers. Our experimental results show that task scheduling can be leveraged to achieve fairness and reduce query latency in multi-tenant crowd-powered systems, although with very di↵erent tradeo↵s compared to traditional settings not including human factors.
منابع مشابه
Optimization techniques for human computation-enabled data processing systems
Crowdsourced labor markets make it possible to recruit large numbers of people to complete small tasks that are difficult to automate on computers. These marketplaces are increasingly widely used, with projections of over $1 billion being transferred between crowd employers and crowd workers by the end of 2012. While crowdsourcing enables forms of computation that artificial intelligence has no...
متن کاملCrowd-Powered Intelligent Systems
Human computation provides a new resource for creating intelligent systems that are able to go beyond the capabilities of fully-automated solutions. Currently, machines struggle in many real-world settings because problems can be almost entirely unconstrained and can vary greatly between instances. Solving problems such as natural language understanding require artificial intelligence to first ...
متن کاملA Data-driven Method for Crowd Simulation using a Holonification Model
In this paper, we present a data-driven method for crowd simulation with holonification model. With this extra module, the accuracy of simulation will increase and it generates more realistic behaviors of agents. First, we show how to use the concept of holon in crowd simulation and how effective it is. For this reason, we use simple rules for holonification. Using real-world data, we model the...
متن کاملArchitecting Real-Time Crowd-Powered Systems
Human computation allows computer systems to leverage human intelligence in computational processes. While it has primarily been used for tasks that are not time-sensitive, recent systems use crowdsourcing to get on-demand, real-time, and even interactive results. In this paper, we present techniques for building real-time crowdsourcing systems, and then discuss how and when to use them. Our go...
متن کاملWhat's Hot in Crowdsourcing and Human Computation
The focus of HCOMP 2014 was the crowd worker. While crowdsourcing is motivated by the promise of leveraging people’s intelligence and diverse skillsets in computational processes, the human aspects of this workforce are all too often overlooked. Instead, workers are frequently viewed as interchangeable components that can be statistically managed to eek out reasonable outputs. We are quickly mo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016